ASR for emotional speech: Clarifying the issues and enhancing performance
نویسندگان
چکیده
There are multiple reasons to expect that recognising the verbal content of emotional speech will be a difficult problem, and recognition rates reported in the literature are in fact low. Including information about prosody improves recognition rate for emotions simulated by actors, but its relevance to the freer patterns of spontaneous speech is unproven. This paper shows that recognition rate for spontaneous emotionally coloured speech can be improved by using a language model based on increased representation of emotional utterances. The models are derived by adapting an already existing corpus, the British National Corpus (BNC). An emotional lexicon is used to identify emotionally coloured words, and sentences containing these words are recombined with the BNC to form a corpus with a raised proportion of emotional material. Using a language model based on that technique improves recognition rate by about 20%.
منابع مشابه
On the Relationship between Emotional Intelligence and Directive Speech Acts Preference
Language and emotion are two related systems in use, in that one system (emotions) impacts the performance of the other (language). Both of them share their functionality in communication. Since the nature of foreign language classrooms is ideally interactional, emotional intelligence (EI) gains importance. The aim of this study was to find out whether one's total emotional quotient and its com...
متن کاملHelpful Statistics in Recognizing Basic Arabic Phonemes
The recognition of continuous speech is one of the main challenges in the building of automatic speech recognition (ASR) systems, especially when it comes to phonetically complex languages such as Arabic. An ASR system seems to be actually in a blocked alley. Nearly all solutions follow the same general model. The previous research focused on enhancing its performance by incorporating supplemen...
متن کاملTowards Robust Spontaneous Speech Recognition with Emotional Speech Adapted Acoustic Models
Speech signal in addition to the linguistic information contains additional information about the speaker: age, gender, social status, accent (foreign accent, dialects, etc.), emotional state, health etc. Some of these informational channels induce changes of the speech acoustic characteristics. This article presents evaluation of the ASR acoustic models (first trained on neutral, read speech) ...
متن کاملAn Examination of the Impact of Customer Relationship Management on Marketing Performance by Clarifying Mediating Role of Innovation and Marketing Memory
The aim of this study is to survey the effect of customer relationship management on marketing performance with regard to the mediating role of innovation and marketing memory in Insurance authority in Kerman province. Population in this research is managers and staff of Insurance corporates in Kerman province and the sample amounted to 252 that were estimated by relative random way and Cochran...
متن کاملImproving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Neural networks : the official journal of the International Neural Network Society
دوره 18 4 شماره
صفحات -
تاریخ انتشار 2005